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Abstract 

Message passing algorithms are popular in many combinatorial optimization problems. For example, 
experimental results show that survey propagation (a certain message passing algorithm) is effective in 
finding proper fe-colorings of random graphs in the near-threshold regime. In 1962 Gallager introduced 
the concept of Low Density Parity Check (LDPC) codes, and suggested a simple decoding algorithm 
based on message passing. In 1994 Alon and Kahale exhibited a coloring algorithm and proved its 
usefulness for finding a fc-coloring of graphs drawn from a certain planted-solution distribution over k- 
colorable graphs. In this work we show an interpretation of Alon and Kahale's coloring algorithm in light 
of Gallager's decoding algorithm, thus showing a connection between the two problems - coloring and 
decoding. This also provides a rigorous evidence for the usefulness of the message passing paradigm for 
the graph coloring problem. Our techniques can be applied to several other combinatorial optimization 
problems and networking-related issues. 



1 Introduction and Results 

A k-coloring f of a graph G = (V, E) is a mapping from the set of vertices V to {1, 2, k}. f is a proper 
coloring of G if for every edge (u,v) £ E, f(u) ^ f{v). In the graph coloring problem we are given a graph 
G = (V, E) and are asked to produce a proper fc-coloring / with a minimal possible k. The minimal value 
of k is called the chromatic number of the graph G, commonly denoted by The problem of properly 

fc-coloring a fc-colorable graph is notoriously hard. It is one of the most famous NP-Complete problems, and 
even approximating the chromatic number within an acceptable ratio is NP-hard [7]. 

Message passing algorithms are popular in many combinatorial optimization problems. For example, 
experimental results show that survey propagation (a certain message passing algorithm) is effective in 
finding proper fc-colorings of random graphs in the near-threshold regime [5_. In his seminal work [14j . 
Gallager in 1962 introduced the concept of Low Density Parity Check (LDPC) codes, and suggested a simple 
decoding algorithm based on message passing. 

In 1994 Alon and Kahale pQ introduced, in another innovative work, a spectral algorithm for coloring 
sparse ("low density") fc-colorable graphs, and showed that their algorithm works u>ftJ3 over a certain 
planted-solution distribution over fc-colorable graphs on n vertices. 



1 By writing whp we mean with high probability, i.e., with probability tending to 1 as n goes to infinity. 



1.1 Our Contribution 



In this paper we connect the two latter results. Specifically, we show that Alon and Kahale's coloring 
algorithm implicitly contains Gallager's decoding algorithm, thus asserting an interesting relation between 
the two problems - coloring and decoding. Theorem 11.11 makes this notion formal. Hopefully, following this 
insight, other heuristics that are useful in decoding algorithms can be applied to the coloring problem; we 
are not aware of previous works pinpointing a connection between the two problems. Our result also gives 
a rigorous analysis of a message passing algorithm in the context of the colorability problem, while up until 
now only experimental results showed the usefulness of the message passing paradigm. 

Our result works in the other direction as well. LDPC codes drew the attention of many researchers ever 
since introduced by Gallager in 1962. The problem of graph coloring can be viewed as a specific instance 
of the decoding problem - the codeword being a proper fc-coloring of the graph (assuming there is a single 
one). Therefore, the analysis of Gallager's algorithm that we provide here can be useful for the analysis 
of Gallager's algorithm on random LDPC codes. Specifically, the study of LDPC codes is concerned with 
two main problems. The first is designing "good" LDPC codes based on expander graphs and studying 
their decoding efficiency close to the code's rate (which usually depends on the expansion properties of the 
graph), for example [20J [16] . These works show that whenever the expander is "good", one can decode a noisy 
codeword of length n close to the code's rate, leaving at most o(n) errors. The second problem concerns the 
efficiency of decoding a noisy codeword with error-ratio some fraction of the code's rate, for example [2"2"lll7| . 
In these works it is shown that whenever the expander is "good" and the codeword doesn't have too many 
errors then typically one can reconstruct the entire codeword (for example, the rate-1/2 regular codes of 
Gallager can provably correct up to 5.17% errors, or the irregular codes in [17] with 6.27%). 

Our result concerns the second problem. In our setting, the input graph G (sampled from the planted 
distribution over fc-colorable graphs, to be defined shortly) on which we analyze Gallager's algorithm is 
typically not an expander, though it contains a large subgraph which is an expander. Loosely speaking, we 
are able to show that typically Gallager's algorithm converges "quickly" on G when starting from a noisy 
coloring which differs from a proper one on the color of say n/120 vertices to a proper fc-coloring of G (up to 
a small number of vertices which remain "undecided" and whose coloring can be completed efficiently) . Thus 
we are able to analyze Gallager's algorithm while relaxing the demand of expansion for the entire graph, 
and replacing it with a more modest requirement. It could be interesting to apply our techniques back to 
the analogous LDPC setting, and in particular to random LDPC codes similar to the ones introduced by 
Gallager himself. 

1.2 The Planted Graph Coloring Distribution 

The model we analyze was suggested by Kucera |15j as a model for generating random fc-colorable graphs 
with an arbitrary average vertex-degree, denoted throughout by G^^k- First, randomly partition the vertex 
set V = {1, ...,n} into k classes Vi, 14, of size n/k each. Then, for every i ^ j, include every possible edge 
connecting a vertex in Vi with a vertex in Vj with probability p = p(n). Throughout, ip* : V — > {1 . . . , k} 
denotes the planted coloring, where the corresponding graph will be clear from context. G^^l is also referred 
to as the planted distribution, and is the analog of the planted clique, planted bisection, and planted SAT 
distributions, studied e.g. in ® EJ Qj] . 
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@n p"fc * s similar to LDPC in some sense. Both constructions are based on random graphs. In codes, the 
received corrupted codeword provides noisy information on a single bit or on the parity of a small number of 
bits of the original codeword. In Gf^l the adjacency matrix of the graph contains noisy information about 
the planted coloring - embedded in eigenvectors corresponding to the (fc — 1) smallest eigenvalues. 

1.3 Our Result 

To avoid a cumbersome presentation we state the results for the case k = 3, and point out that the result is 
easily extended to any fixed k. We call a fc-coloring <p partial if some vertices in <p are UNASSIGNED (that 
is, are left uncolored). From here on we will refer to Gallager's decoding algorithm by Gallager and to Alon 
and Kahale's coloring algorithm by Alon — Kahale. 

Theorem 1.1. Fix e < and let G be a random graph in Q^^l, rip > Co, Co — Co(s) a sufficiently large 
constant. Then running Gallager on G starting from ip, some 3-coloring at distance at most en from if* , 
satisfies the following with probability 1 — e~ e (™P) 

1. Gallager(G, tp) converges to a partial coloring ip' after at most 0(log7i) iterations. 

2. Let Va = {v G V : <p'(v) — <p*(v)}, then \Va\ > (1 — e~ e (" p ))n. Furthermore, all vertices inV\VA are 

UNASSIGNED. 

3. G[V \ Va] can be properly colored (extending the proper coloring of G[Va\) in time O(n). 

Remark 1.2. In item\^ if np > Clogn for a sufficiently large constant C, then whp Va — V, namely all 
the vertices are properly colored, and item 3 becomes void. 

The algorithm Gallager and its adaptation to the coloring problem are given in Section[2]below. Alon — Kahale 
is also described in Section [2T4l for the sake of completeness. For simplicity, we identify by Gallager both the 
decoding algorithm and its adaptation to the colorability problem, while it will be clear from context which 
setting is regarded. 

Comparing our results with the coding setting, known results show that Gallager, applied to carefully 
designed LDPC codes, typically recovers the entire codeword (if the noisy codeword has error rate below 
the code's rate) or all but o(l)-fraction of it (when close to the rate). In our case, Gallager colors correctly 
("decodes") all but a constant fraction of the vertices, or more precisely, at least (1 — e~ B ^ npS) )n vertices are 
properly colored. This is optimal in some sense as whp there will be e~® tynp ^n vertices in G with degree at 
most k — 2, each such vertex can actually take more than one color while still respecting the planted partition. 
Furthermore, the vertices which are left UNASSIGNED (the ones which Gallager fails to "decode") induce a 
"simple" graph (in the sense that it can be exhaustively 3-colored while respecting the already-colored part). 
Nevertheless, in both the decoding setting and the coloring one it is shown that the quality of decoding 
(coloring) improves exponentially with the number of iterations. 

Our result is also comparable with the work of 9J where the Warning Propagation message passing 
algorithm was analyzed for the satisfiability problem. Though Warning Propagation cannot be viewed as 
an adaptation of Gallager's algorithm to the satisfiability problem, the two algorithms are similar in the 
sense that they both involve discrete and rather simple (natural) messages. The result in [9] is very similar 
in nature to the one we obtain, though the initial satisfying assignment can be a random one (due to the 
inherent break of symmetry that SAT possesses - variables can appear positively or negated). 
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I. 4 Structure of Paper 

The remainder of the paper is structured as follows. In Section [2] we present Gallager's decoding algorithm - 
Gal lager - and its adaptation to the coloring problem. We also present Alon and Kahale's coloring algorithm 
and show how it contains Gallager, thus justifying the title of this paper. In Section [3] we discuss some 
properties that a typical instance in G^^a possesses. These properties become handy when proving Theorem 

II. 11 in Section 12.51 Concluding remarks are given in Section [5] 

2 Gallager's Decoding Algorithm 

Before presenting Gallager we give a short introduction to linear codes and their graphical representation. 

2.1 Graphical Representation of Codes 

A linear code C of length n and dimension r is defined by a matrix H G {0, l} rx "; a vector c = (ci, . . . , c„) G 
{0, 1}™ is a codeword (c G C) if He = (mod 2). The matrix H can be viewed as the incidence matrix of some 
bipartite graph B[H] with n nodes on the left, {xi, . . . , x n }, and r nodes on the right, {Ci, . . . , C' r }. Every 
left-side node corresponds to a different bit of the message (we refer to Xi as a the i th variable), and every 
right-side node represents a constraint (we refer to Cj as the j th constraint). We add the edge (xi, Cj) if and 
only if Hij = 1. Every constraint Cj involves the variables in N(Cj) — {xi : (xi, Cj) G E (B[H])}, i.e. the 
neighbor set of Cj. The code C can now be defined equivalently in terms of the bipartite graph B[H] in the 
following way: c G {0, 1}" belongs to C if when assigning Xi = Cj, then for every constraint Cj the exclusive-or 
of the bits in Ng\m{Cj) is 0. If every variable appears in exactly s constraints, and every constraint involves 
exactly t variables, then the bipartite graph is called (s, t)-regular (and so is the corresponding code). If t 
and s are constants, that is - every constraint involves at most a constant number of variables, and every 
variable takes part in a constant number of constraints, then the code is called low-density (an equivalent 
requirement is that the check-matrix of the code is sparse) . 

2.2 Gallager's Algorithm 

Gallager's decoding algorithm is a message passing algorithm defined on the bipartite graph B[H], which is 
also referred to as the factor graph in the context of message passing algorithms. Gallager's algorithm is an 
example of hard decision decoding, which signifies the fact that at each step the messages are derived from 
local decisions of whether each bit is or 1, and this is all the information the message contains (as opposed 
to more detailed probabilistic information) . We note that Gallager also proposed a belief propagation type 
decoding algorithm, which uses a more complicated message set. 

There are two types of messages associated with every edge (xi, Cj) of B[H]; a message from a constraint 
Cj to a variable Xi and vice versa. Intuitively, we think of the message Xi — > Cj as some sort of a majority 
vote over the messages Cj/ Xi (for j ^ j'). Similarly, Cj sends Xi the "preferred" value for Xi - if the 
exclusive-or, ©, without Xi is b G {0, 1}, then to keep the constraint satisfied, Xi should take the value b. 

Formally, let r > be some fixed integer and a = (ai, a.%, a n ) the received codeword. Set Cj' b = {Cj> : 
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C , Xi = b A? ?j} tor b& {0,1}. 



X{ ► Cj 



ai , otherwise 

Cj -+Xi = © ie jv B|H1 ( Cj )\{ Il } x -> c i 



Figure 1: Gal lager's messages 

It is convenient to define a third message which is not sent during the algorithm, but is used to calculate 
the final decoding. For every define 



Bi = 



We are now ready to present Gal lager. 



6, \{C : C -» Xi = 6}| > r 
oti , otherwise 



Gallager(a, t): 

1. for every edge {xi, Cj) : set Xi — » Cj = a>i 

2. repeat until no message changed: 

(a) update in parallel all messages Cj — » x^. 

(b) update in parallel all messages xi — > Cj . 

3. return for every i, Xi = Bi. 



Figure 2: Gallager's algorithm 

The algorithm is allowed not to terminate (the messages may keep changing every iteration). If the 
algorithm does terminate, then we say that it converges. In practice, it is common to make an a-priori limit 
on the number of iterations, and return failure if the algorithm does not converge within the given limit. It 
should be noted that Gallager is often described with r being the size of the constraint minus 1. 



2.3 Gallager for Colorability 

Given a fc-colorable graph G one can define its factor graph B[G]. The constraints (right hand side nodes) 
are the edges of G, and the left-hand nodes, the variables, are the vertices. For G V(G), and Cj S E{G) 
we add the edge (xi,Cj) if Xi e Cj. The vector c G {1, fc}l y ' G ^ is a proper /c-coloring of the graph if 
the exclusive-or of each constraint (edge) is non-zero (where the exclusive-or of two integers is the bitwise 
exclusive-or of their binary representation) . The factor graph of the coloring problem is a special case of the 
LDPC factor graph in which every constraint involves exactly two variables. 

We suggest the following interpretation of the messages in Figure Q] for the coloring setting. Fix some 
constant r (Theorem 11.11 is proven with r = 1), and set as before C*' b = {Cj' : Cj' — > Xi — b A f ^ j}, 
b G {1, . . . , k}. We let argmin denote the index of the minimal elements of the given set. 
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Undecided, 3/, m s.t. I ^ m and 
Z, I = argmin t Cj' , 



1,1 




< t; 



Figure 3: Gallager's messages for colorability 



Though at first glance the messages may appear different than the ones in Figure [T] this is only due to 
the fact that the coloring setting is not a binary one. x t sends Cj the minority vote over the colors it receives 
from the other constrains (edges) it appears in (or Undecided if there weren't enough votes to begin with) . 
The constraint Cj sends the variable xi which color it must not take. To see the similarity to the coding 
setting, observe that a congruent set of messages for Gallager's algorithm would be for a variable to send 
the least popular bit amongst the messages it received, and for a constraint to send the value which the 
variable should not take. The constraint to variable message, Cj —> Xi, is the same in both settings since in 
the coloring problem the exclusive-or is taken over one item (every constraint - edge in this case - contains 
exactly two variables), and therefore it is the item itself. 

Gallager can now be used for the coloring problem, with the input a = (tp(xi), . . . , tp(x n )) being the (not 
necessarily proper) fc-coloring vector of the vertices, and the B^s defined by 



2.4 Alon and Kahale's coloring algorithm 

The seminal work of Alon and Kahale p] paved the road towards dealing with large constant-degree planted 
distributions. They present an algorithm that whp 3-colors a random graph in ^^3, where np > Cq and 
Co is a sufficiently large constant. Since our result refers to their algorithm we give a rough outline of it 
here, and refer the interested reader to [1] for the complete details. 

The algorithm is composed of three main steps. First using spectral techniques one typically obtains a 
fc-coloring that agrees with the planted one on many vertices (say 0.99n). Then follows a refining procedure 
(step 2,3) which terminates typically with a partial fc-coloring that coincides with the planted coloring on 
the colored vertices. This coloring typically colors all but e~ Q<Jlp ^n vertices. Finally, in step 4, the small 
graph induced by the uncolored vertices is exhaustively searched for a fc-coloring that extends the fc-coloring 
of the rest of the graph. 

2.5 Gallager meets Alon-Kahale 

We now show how Alon — Kahale actually runs Gallager. This observation is not made in pQ, nor the con- 
nection to message passing whatsoever. 

We first claim that one can unify the recoloring and unassignment steps of Alon — Kahale (steps 2 and 3) 
as follows. In the recoloring step, assign each vertex with the least popular color amongst its neighbors, or 
set it Undecided if it has less than r neighbors colored in some color other than its own (t is some fixed 
integer, say 2). Using the same arguments as in [1] one can prove that the algorithm performs the same 




l}\,\{Cj-.Cj 



Xi = m}\ < t 



(2.1) 



G 



Alon-Kahale(G,fc): 

step 1: spectral approximation. 

1. obtain an initial fc-coloring of the graph using spectral methods. 
step 2: recoloring procedure. 

2. for i = 1 to logn do: 

2. a for all v € V simultaneously color v with the least popular color amongst its neighbors. 
step 3: uncoloring procedure. 

3. while 3v s V with less than np/10 neighbors colored in some other color do: 

3. a uncolor v. 

step 4- Exhaustive Search. 

4. let U C V be the set of uncolored vertices. 

5. consider the graph G[U]. 

5. a if there exists a connected component of size at least logn - fail. 
5.b otherwise, exhaustively extend the coloring of V \ U to G[U]. 



Figure 4: Alon and Kahale's coloring algorithm 

with the unified recoloring-uncoloring scheme. It is then left simply to observe that the unified step of the 
new Alon — Kahale is exactly Gallager (with the messages as in Figure [3]) since the message Bi (|2.1|) which 
implies the color of Vi reads exactly this. 

3 Properties of a Random Instance 

In this section we introduce some properties of a typical graph in Q„^z- These properties will come in 
handy when proving Theorem 11.11 in the next section. Properties of similar flavor can be found in pQ along 
with complete proofs which arc omitted here due to space considerations. Given a subset U of vertices, e(U) 
denotes the number of edges spanned by the vertices of U. 

The first property we discuss is discrepancy. Specifically, a random graph whp will not contain a small 
yet unexpectedly dense subgraph. Formally, 

Proposition 3.1. Let G be a graph distributed according to Gf^p 1 ^ with np > Cq. Cq a sufficiently large 
constant. Then whp there exists no subgraph of G containing at most n/60 vertices whose average degree is 
at least np/15. 

The next property we discussed was introduced in pQ and plays a crucial role in the analysis of the 
algorithm. Loosely speaking, a vertex v is considered "safe" w.r.t. ip* if v has many neighbors from every 
color class of <p* other than its own, and these neighbors are also "safe" w.r.t. ip* . v is "safe" in the sense 
that if it converged to a wrong color (w.r.t. ip*) when the algorithm terminates, then it has many wrongly- 
colored neighbors w.r.t. ip* , and so do they. This avalanche effect of many wrongly-colored vertices is, under 
certain conditions, very unlikely to happen. Therefore, typically the "safe" vertices converge correctly. The 
following definition makes this notion formal. 

Definition 3.2. A set of vertices is called a core of G — (V,E), denoted core (G) if for every v S core(G): 
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• v has at least np/5 neighbors in core (G) n Vi for every i ^ (p*(v). 

• v has at most np/20 neighbors from V \ core (G), 

Proposition 3.3. Let G be a graph distributed according to with np > Go, Go a sufficiently large 

constant. Then whp there exists a core core (G) satisfying |core(G) | > (1 — e _e ( np ))n. 

The main idea of the proof is to prove that the following procedure typically outputs a big core. Set 
X to be the set of vertices having at least np/A neighbors in G in each color class of if* other than its own. 
Then, repeatedly, delete from X any vertex that has less than np/5 neighbors in X in some color class other 
than its own or more than np/20 neighbors not in X. This procedure clearly defines a core. To see why this 
core is typically large - observe that to begin with very few vertices are eliminated from the core (since all 
but e~ e ( np )n vertices have degree, say, 0.99np/3 in every color class other then their own). If too many 
vertices were removed in the iterative step then a small but dense subgraph exists (as every removed vertex 
contributes at least np/20 edges to that subgraph). Proposition 13.11 bounds the probability of the latter 
occurring. 

Next we characterize the structure of the graph induced by the non-core vertices (the non-core graph). 

Proposition 3.4. Let G be a graph distributed according to Q^^t with n P > Co, Go a sufficiently large 
constant. Then whp every connected component in the non-core graph contains O(logn) vertices. 

Proposition 13. 4| whose complete proof is given in [1] , will not suffice to prove Theorem ll.l| and we need 
a further characterization of the non-core graph. Using similar techniques to those used in the proof of 
Proposition 13.41 we prove: 

Proposition 3.5. Let G be a graph distributed according to Gfip^ with np > Go, Go a sufficiently large 
constant. Then with probability 1 — e~ e (™ p ) there exists no cycle in the non-core graph. 

Remark 3.6. Observe that Provosition 13.51 is true only with a constant high probability. Ln fact, whp the 
connected components of the non-core graph are trees with an additional edge (unicycles). While one may 
be able to prove convergence o/Gallager on such a structure as well, it will be considerably more complicated 
and technical and not too insightful to the problem that we discuss in this paper. For more details the reader 
is referred to JVjj. 

4 Proof of Theorem 11.11 

A graph G is said to be typical in Q^^l if Propositions 13.11 1 3.3113. 41 and 13.51 hold for it. The discussion in 
Section [3] guarantees that whp G is typical. Therefore, to prove Theorem 1 1.1 1 it suffices to consider a typical 
G. In the propositions below we assume that G is such and do not state anew each time. 

We will split the proof into two parts corresponding to the three items of Theorem 11.11 First we show 
that the messages sent between variables and constraints spanned by core (G) converge to (p* when starting 
with a not "too noisy" coloring. Second we show that the messages sent in the non-core part of the graph 
converge to ip* or to the Undecided message. This combined with Proposition 13 . 51 implies that the coloring 
of the Undecided vertices can be completed in linear time (list coloring of a tree) . 
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Proposition 4.1. Let if be a 3-coloring of G that disagree with if* on at most n/120 vertices. Then whp 
all messages of Gallager(y), 1) spanned by the vertices of core (G) converge after at most O(logn) iterations. 
Furthermore, the message P>i of i>, G core (G) equals <p*(vi). 

Proof. For every vertex u we define </?«-+e to be the value of the message u — > e that the variable (vertex) u 
sends the constraint (edge) e = (u, v) in iteration i of Gallager (according to the message defined in Figured]). 
Let Ui be the set of core vertices for which <p$e) ^ <P*( U )- It suffices to prove that |£/j| < |?7j_i|/2 (if this 
is true, then after logn iterations Ui ogn = 0). Observe that by our assumption on ip - \Uq\ < n/60. By 
contradiction, assume that not in very iteration \Ui\ < |[/j_i|/2, and let j be the first iteration violating 
the inequality - \Uj\ > |[/j_i|/2. Consider a vertex u G Uj. If ^?u-+e 7^ j t nen there must be at 

least np/20 vertices w amongst it's neighbors s.t. <pw-*e / ( y 3 *( u; )- To see this, observe that any vertex in 
core (G) has at most np/20 neighbors outside core (G) (by definition of core), and hence if it has at most 
np/20 + np/20 = up/10 wrongly colored neighbors, then it has at least np/A — np/10 > np/10 vertices 
from the each of the correct colors, and therefore <pu— >e agrees with <p*(u) (contradicting our assumption). 
For conclusion, let U = Uj U Uj—\. Then every u G Uj has at least np/10 neighbors in Uj~\. Assuming 
\Uj\ > \Uj-\\/2, the average degree in G[U] is at least n P/ x ^\ u ^ > g^Zl^Hll > n p/l5. Recalling that 
\U\ < 2 • n/120 < n/60 contradicts Proposition O □ 

Proposition 4.2. Let irj be the iteration in which ipu°l e = <P*(u) for every u G core (G) (if'ule is the message 
u — > e in the i th iteration). Then after at most t = Iq + O(logn) iterations, all messages of Gallager(<£, 1) 
converge. Furthermore, for every u G V(G), ^i— » e *s either ip*(u) or Undecided. 

Proof. We first note that it suffices to show that all messages leaving variables will converge as stated, since 
all messages leaving the constraints are just repeaters of messages received at the constraint in the previous 
iteration. Let Go = G[V \ core(G)] and consider the factor graph that Go induces (which is also a tree); 
for a tree edge (w, e) in the factor graph we define level(u, e) to be r if r is the maximal length of a path 
between u and a leaf in the factor graph from which the edge (it, e) is removed. By induction on the level r 
we prove that in iteration r + ig all vertices at level r transmit either Undecided or their planted coloring. 
Observe that by the assumption in the proposition, at all iterations i > io, all messages leaving core (G) into 
Go state the correct color. 

The base case is an edge (u, e) at level 0. If level(u, e) = then (pv^Ie is either f*{v) or Undecided 
since the only messages that take part in the calculation are the ones coming from core (G) (if any), and they 
agree with tp*(u). Now consider an edge (u, e) at level r, and consider iteration io + r. The key observation 
is that the level of all edges (v,e'), e' — (u,v) is strictly smaller than that of (it, e) (since when removing 
(it, e), the way from it to all the leaves passes through every (v, e')). Now apply the induction hypothesis 
and repeat the same argument of the induction step. Finally, since by Proposition 13.41 the level of an edge 
is at most O(logn), we have that after O(logn) iterations (from io) Gallager converges to either the proper 
coloring or to Undecided. □ 



This completes the proof of Theorem [O 
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5 Discussion 



Message passing algorithms are a trendy and promising area of research in many interesting and fundamental 
optimization problems, attracting researchers from different disciplines, e.g. statistical physics, coding theory 
and more. Some experimental results show the effectiveness of such algorithms for instances that seem "hard" 
for many other heuristics [5] . Alas, not many of the message passing algorithms were rigorously analyzed due 
to the inherent complexity of this task. Nevertheless, in this work we give a rigorous analysis of a well-known 
message passing algorithm in the context of the colorability problem, pinpointing an interesting connection 
between this problem and decoding algorithms, and showing the possible value of message passing heuristics 
for colorability. Our result also works in the other direction, that is, by introducing new analytical tools to 
analyze Gal lager in the colorability setting we suggest the same tools for establishing rigorous results in the 
ever developing study of LDPC codes. 

Our results also extend naturally to other partition problems where spectral techniques provide a first 
approximation and then an iterative algorithm is used to find a proper solution [21 I10[ 0] . It would be 
interesting to see whether one can adjust Gallager to these settings and prove similar results to ours. 

Another interesting question is to study the counterpart of code-rate in the colorability problem. Specifi- 
cally, what is the "farthest" coloring from the planted one, from which one can prove convergence of Gallager? 
A random 3-coloring will be at distance roughly 2n/3 from the planted one. Now if every vertex with d 
neighbors has exactly d/3 neighbors in every color class (which is what happens in expectation when starting 
from a random coloring), then this is a fixed point of the system and Gallager stays "stuck". Therefore it 
is safe to guess 2n/3 as an upper bound for that distance. In this work our techniques enable us to prove 
rt/120 as a lower bound, and although no effort was made to optimize the constants, we do not believe to 
be able to reach the aforementioned upper bound. It will be interesting to close this gap. 
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